Picture for Heyan Huang

Heyan Huang

DeepSurvey-Bench: Evaluating Academic Value of Automatically Generated Scientific Survey

Add code
Jan 13, 2026
Viaarxiv icon

Beyond Literal Mapping: Benchmarking and Improving Non-Literal Translation Evaluation

Add code
Jan 12, 2026
Viaarxiv icon

MMWOZ: Building Multimodal Agent for Task-oriented Dialogue

Add code
Nov 16, 2025
Viaarxiv icon

PRIM: Towards Practical In-Image Multilingual Machine Translation

Add code
Sep 05, 2025
Figure 1 for PRIM: Towards Practical In-Image Multilingual Machine Translation
Figure 2 for PRIM: Towards Practical In-Image Multilingual Machine Translation
Figure 3 for PRIM: Towards Practical In-Image Multilingual Machine Translation
Figure 4 for PRIM: Towards Practical In-Image Multilingual Machine Translation
Viaarxiv icon

A Survey of Automatic Evaluation Methods on Text, Visual and Speech Generations

Add code
Jun 06, 2025
Viaarxiv icon

DocMEdit: Towards Document-Level Model Editing

Add code
May 26, 2025
Viaarxiv icon

T2I-Eval-R1: Reinforcement Learning-Driven Reasoning for Interpretable Text-to-Image Evaluation

Add code
May 23, 2025
Viaarxiv icon

EduBench: A Comprehensive Benchmarking Dataset for Evaluating Large Language Models in Diverse Educational Scenarios

Add code
May 22, 2025
Viaarxiv icon

SEOE: A Scalable and Reliable Semantic Evaluation Framework for Open Domain Event Detection

Add code
Mar 05, 2025
Viaarxiv icon

Consistent Client Simulation for Motivational Interviewing-based Counseling

Add code
Feb 05, 2025
Viaarxiv icon